An Introduction Into Python¶

Table of contents:

Environment Setup
Why Python?
Syntax
Variables and Datatypes
Operators
Collection Data-Types
The if statement
For Loops
Nested Loops
While Loops
continue and break

Environment Setup¶

The most recent major version of Python is Python 3, which we shall be using in this tutorial. However, Python 2, although not being updated with anything other than security updates, is still quite popular.

The Official Python Website

Download Python

With over 6 million users, the open source Anaconda Distribution is the fastest and easiest way to do Python and R data science and machine learning on Linux, Windows, and Mac OS X. It's the industry standard for developing, testing, and training on a single machine.

Anaconda gives you preinstalled tools such as Jupyter Notebooks, Spyder and RStudio. It is the one thing you will need to set up your development environment.

Please follow this tutorial to install Anaconda : Installing Anaconda on Windows

Why Python?¶

Python is an easy to learn, easy to use programing language.
It is intuitive, a lot of the syntax looks like English Language.
No data type declaration for a variable.
Support for inbuilt structures like Lists and Dictionaries make it great to work with.
Most ML libraries are available in Python.
More reasons will become clearer as you learn about Python.

Suppose we were to write a program to read 10 strings, and merge the ones that started with a "Hi". How big would the program be?

In [1]:

inp_list = [input() for i in range(10)]
inp_list_2 = [i for i in inp_list if str.lower(i[:2])=="hi"]
print("".join(inp_list_2))

Hey
Hi
Hiya
Hira
h
f
s
w
e
r
HiHiyaHira

See how easy this is? Note that the syntax is pretty intuitive as well.

Don't worry if you can't understand anything here, soon you will be able to follow along as well. Let's dive into the basics.

How To Launch Python¶

We can launch Python by opening terminal and typing "python3". This opens up an interactive terminal, where you can type the commands.
Another option is to open up a text editor, type the program, save the file with a '.py' extension (instead of .docx or .txt). The name of the file will be prog_name.py, so to execute it, we go to terminal and type "python3 prog_name.py".

Errors¶

Errors are key to identifying the mistakes in your program. They will show up for various reasons like syntax errors and logical errors.

In languages like C, errors in syntax are brought up whether or not those statements are executed (conditional statements). In Python, unless a particular block of code is executed, even syntax errors will not show up for that block of code.

This may seem counterproductive, but as you will later see, it is the basis for how Python functions.

Let's Begin!¶

Syntax¶

Syntax is nothing but the format you need to follow. They are the rules you need to adhere to.

The legendary Hello World is here.

In [2]:

print("Hello World!")

Hello World!

To print something, we write print followed by the thing we want to.

We enclose strings (i.e. text) in quotes. In python, we can use single quotes (' '), double quotes(" ") or triple quotes(''' ''') to do so. Triple quotes is an exclusive Python added functionality.

Python is Case-Sensitive.

In [3]:

a = 5
A = 10
print(a)
print(A)

5
10

Did you notice - There's no semi-colon! Python statements do not require an ending semi-colon.

P.S.: Semicolons can be used though, if you have multiple statements on one line.

In [4]:

a=6;A=10

In [5]:

print(a)
print(A)

6
10

What are variables? These are containers to store temporary values. Anything you need to store, so that you can access it later in the program, is done by using a variable.

In the above code segment, a and A are variables.

A few syntax rules for variables are listed below:

Variables names must start with a letter or an underscore
The remainder of your variable name may consist of letters, numbers and underscores
Variable names cannot be keywords.
As mentioned earlier, Python is case sensitive.

What are keywords?¶

Words that have a predefined meaning in the language, like print, input, list, str, int, and a lot more. As and when you find out more about the language, you will know which words add to this list.

Other important syntax rules will be covered as and when we encounter the relevant concepts.

Variables and Datatypes¶

As we saw earlier, a variable is used to store a value in memory. Just like we need to remember things before an exam in order to be able to answer questions, a computer needs to store values in order to work with them.

The concept of datatypes is crucial to programming.

What is a datatype?

It is used by languages to ensure that operations on these values are legitimate. And also because adding 2 integers is different to adding 2 strings. The process is different altogether.

What is a legitimate operation?

Consider you want to add 2 things. The only things that we consider adding are numbers, right? We can't add a character and a number. So, to ensure that what things we want to do actually make sense, programming languages use the concept of datatypes. Thus errors are detected easily.

Most languages require you to declare what kind of value will a given variable store.

The kind of values that can be stored can be categorized into these broad headings:

Numerical
Textual
Boolean (True/False)
There sometimes may be further categorization as well.

The most annoying thing that other languages like C have is datatype declaration in the code for a variable. Once a variable is of a particular type, it can't be changed to store another type. Python does not require this!

Does this mean the problem of checking datatypes does not exist in Python? No. Instead, Python does this checking just before execution.

How?

Python does not only store the variable value, but it also stores the type of the value stored with it.

How data is stored

Thus, we generally do NOT need to care about variable type, as we can reassign a variable which held an integer to now hold a string.

Now, for some examples on using variables.

Tell me which of the following are valid variable names:

my_var_1 = 10
my-var-1 = 10
print = 200

The first one is right.

The second one uses a '-', which is not allowed.

The third one tries to assign a value to a keyword, which is also not allowed.

Let's dive into Datatypes now, as it will help give you some clarity into what is all this Integer Float Double and whatnot.

The type() function gives us the datatype of a variable. We will look at functions in the next lecture.¶

In [6]:

an_int = 10

In [7]:

type(an_int)

Out[7]:

int

An integer is used to store basic integer values (the mathematical set of integers, I/Z).

In [8]:

a_float = 10.0

In [9]:

type(a_float)

Out[9]:

float

A float is used to store decimal values(also called floating point numbers).

In [10]:

an_str = 'a'

In [11]:

type(an_str)

Out[11]:

str

A string can store text. It can consist of anything, we will look at them in depth later.

To take input from the user, we use input(). This returns the user input in the form of a string, we can store it in a variable.

In [12]:

print("Enter something")
inp = input()

Enter something

In [13]:

type(inp)

Out[13]:

str

In [14]:

a_complex = 1+3j

In [15]:

type(a_complex)

Out[15]:

complex

Python provides in-built support for the mathematical concept of a complex number.

The format is a+bj, where a and b are real values and j is a character which is essential and non-replacable. It differentiates a complex number from a real number or from looking like an addition operation.

Complex Number 2-D Space

If we want to store an imaginary number that lies on the real axis, we can use code like so:

a_complex = 2+0j

In [16]:

a_boolean = True

In [17]:

type(a_boolean)

Out[17]:

bool

A boolean is the most basic of datatypes, which only has 2 values: True or False.

A very key concept in Python is that we can always use the same variable to store different types. Also, we may not know what type of data will the variable store, and the program will run smoothly unless you try to perform an illegitimate operation like add an integer and a string.

Now, let's move on to operators.

Operators¶

The base of programming lies in performing mathematical operations by using operators. We'll go over them one-by-one.

Arithmetic Operator
Relational Operator
Assignment Operator
Logical Operator
Membership Operator
Identity Operator
Bitwise Operator

Arithmetic Operators are used to perform arithmetic operations on variables. The major ones are listed below:

The Basic Ones:

Addition = 3 + 4
Subtraction = 3 - 4
Multiplication = 3 * 4
Division = 3 / 4
Modulo = 3 % 4 #This gives the remainder of division

To perform $$x^y$$Python provides an operator.

In [18]:

3**4

Out[18]:

We can also perform division and truncate the part after the decimal to get an integer result instead of a float (NOT rounding off) using 1 operator.

In [19]:

3//4

Out[19]:

Relational Operators are used to perform comparisons, they always return boolean values.

The Basic Ones:

Lesser than = 3 < 4
Lesser than or equal to = 3 <= 4
Greater than = 3 > 4
greater than or equal to = 3 >= 4

To check for equality, we use ==

In [20]:

3==4

Out[20]:

False

Note: a=b is an assignment operation, while a==b is to check wheter a is equal to b

To check for inequality, we use !=

In [21]:

3!=4

Out[21]:

True

Assignment Operators and Shorthand Notation

The statement

a = 4

assigns a with the value 4. a is now a variable.

a+=b is a shorthand for a = a + b.

Similarly, a-=b, a=b, a/=b, a*=b,a%=b,a//=b exist.

In [22]:

a = 5
a+=4
print(a)

In [23]:

a = 5
a-=4
print(a)

In [24]:

a = 5
a*=4
print(a)

In [25]:

a = 5
a/=4
print(a)

1.25

In [26]:

a = 5
a%=4
print(a)

In [27]:

a = 5
a**=4
print(a)

In [28]:

a = 5
a//=4
print(a)

Logical Operators are used on boolean values. They are used to operate boolean logic.

In [29]:

True and False

Out[29]:

False

In [30]:

True or False

Out[30]:

True

In [31]:

not True

Out[31]:

False

Membership Operator will be covered in the next lecture.

Identity Operator will be covered later.

Bitwise Operators are used to perform binary operations on numeric values.

Numbers are stored in binary format, hence bitwise operations can sometimes be useful.

& performs bit by bit AND operation of 2 numbers.

In [32]:

0&1

Out[32]:

| performs bit by bit OR operation of 2 numbers.

In [33]:

0|1

Out[33]:

^ performs bit by bit XOR operation of 2 numbers.

In [34]:

0^1

Out[34]:

~ performs bit by bit 1's complement operation of a number.

In [35]:

~1

Out[35]:

-2

<< performs a left shift operation on a number.

In [36]:

4<<2

Out[36]:

>> performs a right shift operation on a number.

In [37]:

4>>2

Out[37]:

Now, we move on to a few advanced Python data structures, which give Python the power it has.

Collection Data-Types¶

So far we have seen built-in types like: int, float and bool. int, float, and bool are considered to be simple or primitive data types because their values are not composed of any smaller parts. They cannot be broken down.

On the other hand, strings, lists, sets, dictionaries and tuples are different from the others because they are made up of smaller pieces. In the case of strings, they are made up of smaller strings each containing one character.

Types that are comprised of smaller pieces are called collection data types. Depending on what we are doing, we may want to treat a collection data type as a single entity (the whole), or we may want to access its parts. This ambiguity is useful.

Strings¶

Strings are a collection of characters. They can store a line of text. Strings are arrays of bytes representing Unicode characters.

Interestingly enough, Python does not have a character datatype. It stores characters as a string of length 1.

Strings can be created by using:

Single Quotes(' ')
Double Quotes(" ")
Triple Quotes(''' ''')

Why are there 3 ways? To answer that, let's look at another question: If we want to store a sentence containing single quotes and/or double quotes, how do we do that?

The answer is we can use triple quotes to define the start and end of the string.

But what is we want to use triple quotes as well?

Thus we have something called an Escape sequence.

For characters that have special meaning in a string, we can use an escape sequence, to render that character in the string.

Didn't get a word I'm saying? I'll give you an example.

In [38]:

str_1 = "Hi! 'I'm Here"

This is valid

In [39]:

str_2 = ''' Hi! "Hello I'm here ''' 

This is valid as well.

In [40]:

str_3 = ''' Hello! I'm "Happy!" These are triple ''' quotes''' 

  File "<ipython-input-40-c9ff74d270a6>", line 1
    str_3 = ''' Hello! I'm "Happy!" These are triple ''' quotes'''
                                                              ^
SyntaxError: invalid syntax

This is not valid.

In [41]:

str_3 = 'Hello! I\'m "Happy!" These are triple \'\'\' quotes'

In [42]:

print(str_3)

Hello! I'm "Happy!" These are triple ''' quotes

Here, notice how single quotes preceded by a backslash(\) are rendered as a single quote and do not cause the string to end prematurely.

A few escape characters are mentioned below:

Single Quote: '
Double Quote: "
Backslash: \\

Indexing a String¶

Since a string is a collection of characters, we can access characters individually as well from a string. To do so, we use the following syntax:

str_1 = "Hello"
print(str_1[1])

This will output 'e'.

Indexes in Python start at 0. Thus, for str_1,

H is stored at index 0
e is stored at index 1
l is stored at index 2
l is stored at index 3
o is stored at index 4

Index of a String

Python supports negative indexing, that is the last character is stored at index -1, second last at -2 and so on.

In [43]:

str_1 = "Hey! Hello! Hi!"

In [44]:

str_1 = "Hello"

The value inside the square brackets is the index whose value you want.

In [45]:

print(str_1[1])

In [46]:

print(str_1[-1])

We can get multiple items from a string (slicing) by using a semi-colon. The emi-colon can be thought of as meaning 'everything'.

If it is preceded by an integer, it means everything from the element at that index.

str_1[2:]

If it is followed by an integer, it means everything upto but not including the element at that index.

str_1[:4]

If it has both the above, it returns a slice of the string, starting at the element at the index preceding it, until the index following it.

str_1[2:4]

In [47]:

str_1[2:]

Out[47]:

'llo'

In [48]:

str_1[:5]

Out[48]:

'Hello'

Lists¶

The array substitute in Python with some added functionality is a List. A list is a type of container, which is used to store multiple data items at the same time. It is ordered (i.e. it can be indexed).

List Image

In [49]:

lst = [2,3,'a','b',5.6,True, 1+3j]

In [50]:

type(lst)

Out[50]:

list

In [51]:

print(lst)

[2, 3, 'a', 'b', 5.6, True, (1+3j)]

It can also contain another list within itself. This is how higher dimensional arrays can be stored.

Indexing is the same in lists as it is in strings. For a 2-D array (list of lists), we have to use indexing as shown below.

In [52]:

twod_arr = [[0,1],[2,3]]
twod_arr[0][0]

Out[52]:

Dictionaries¶

A dictionary stores pairs of key-value pairs. For those of you who know hashing, dictionaries are nothing but an implementation of a hashing function.

Basically, in lists, we used default indexes(0,1,2,etc). Here, we can define custom indices. They are known as keys. They can then store corresponding values. They are implemented using hash functions.

Dictionary Vs. List

In [53]:

my_dict = {'b':'beauty','j':"joy"}

In [54]:

my_dict['b']

Out[54]:

'beauty'

Thus, we can now have strings act as indices too!

Sets¶

It is a collection of unordered items. It follows the properties of a mathematical set, where no duplicate elements are stored.

In [55]:

my_set = {"DS","AI","DL"}

In [56]:

print(my_set)

{'DS', 'DL', 'AI'}

In [57]:

my_set_2 = {'a','a','b','b'}

In [58]:

print(my_set_2)

{'a', 'b'}

Tuples¶

A tuple is a list, which cannot be changed/modified. Tuples are ordered, just like lists.

In [59]:

my_tuple = ("a","b","c")

In [60]:

my_list = ["a","b","c"]

In [61]:

my_tuple[0] = 'd'

---------------------------------------------------------------------------
TypeError                                 Traceback (most recent call last)
<ipython-input-61-3b4380fe39e6> in <module>()
----> 1 my_tuple[0] = 'd'

TypeError: 'tuple' object does not support item assignment

In [62]:

my_list[0] = 'd'

All datatypes have associated built-in methods which help in performing basic tasks on them. We will look at them once we look at functions, and understand objects.

The if statement¶

This is used to decide if a statement / a group of statements should be executed, based on a condition.

In [63]:

a = 5
b = 6
if a<b:
    print(a)
    print("Yes!")

5
Yes!

Observe a few things:

The syntax is

if <expression>:
    <statements>

The statements after 'if' are executed if the expression evaluates to True.
Notice that we do not have a bracket to denote the part of code which is to be executed if the condition is true.
Instead, we use indentation to denote that block of statements. A constant indent tells Python that those statements belong to one code block.

For a complicated condition(using logical operators and/or expressions), we enclose all of it in a bracket.

Now, we can also extend this to execute another block of statements if the condition evaluates to False.

In [64]:

a = 5
b = 6
if a>b:
    print(a)
    print("Yes!")
else:
    print(b)
    print("No!")

6
No!

Thus, the keywords 'if' and 'else' are used to execute statements based on a condition.

What do you do if you want to have multiple condition based statement execution?

For example, if a person is considered:

Short if his height is less than 4 feet.
Average if his height is between 4 and 5 feet.
Tall if his height is more than 5 feet.

We need another keyword here to make our lives easier, called 'elif'.

In [65]:

height = 4.5
if height<4:
    print("Short")
elif height<5:
    print("Average")
else:
    print("Tall")

Average

The importance of indentation is paramount in Python. It replaces the cumbersome curly brackets that exist in C, and which if you forget to close, can lead to so many problems.

Indentation makes it easy to trace which code block does which part of code belong to. When you have nested loops (explained later), it aids in visual understanding.

For Loops¶

A for loop is used to repeat a block of instructions/statements multiple number of times(most common use case for beginners). In Python, it is used to iterate over a sequence of items as well (lists, sets, tuples and even strings).

In [66]:

for i in range(10):
    print(i)

Here, the variable following for (in this case, i) is assigned values from the sequence of items serially which is referred to after the 'in' keyword. The range() function generates numbers from 0 to the number passed to it. This will become clearer when we study functions. For now, this is standard syntax when we want to generate numbers from 0 to n.

Thus, the variable is assigned values from the sequence mentioned. Another example is given below:

In [67]:

my_list = [1,2,'s','errve',0.4, True]
for i in my_list:
    print(i)

1
2
s
errve
0.4
True

We can loop through a string as well.

In [68]:

for i in "Hey there!":
    print(i)

H
e
y
 
t
h
e
r
e
!

In short, we can loop through any collective datatype, fetching every item from the datatype and assigning it to the variable mentioned after the 'for' keyword.

Here, let us have a look at Nested Loops, which are extremely powerful.

Nested Loops¶

When we have a loop within a loop, we have Nested Loops. These can be used to generate various patterns.

In [69]:

for i in range(3):
    for j in range(3):
        print(i,j)
    print()

In [70]:

for i in range(5):
    for j in range(i+1):
        print('*',end='')
    print()

*
**
***
****
*****

range(n) goes from 0 to n-1

Thus, using nested loops, you can set a variable inner loop dependent on the outer loop.

Also notice how indentation now makes the code extremely readable.

While Loops¶

Here, we can execute a set of statements as long as a condition is true.

In [71]:

a = 10
while(a>0):
    print(a)
    a-=1

Thus, in Python, for loop can iterate over a collective datatype, and a while loop executes until a statement evaluates to False.

continue and break¶

The continue statement is used in loops to skip all the following statements in the current code block, and proceed with the next iteration.

In [72]:

a = 10
while(a>0):
    a-=1
    if(a<5 and a>1):
        continue
    print(a)

The break statement on the other hand, exits the loop being executed.

In [73]:

a = 10
while(a>0):
    a-=1
    if(a<5 and a>1):
        break
    print(a)

Time for a few exercises:

Write a program that finds the units place digit of a number.
Write a program that prints the following pattern:

****

*****

******